Variable time-scale modification of speech using transient information
نویسندگان
چکیده
Conventional time-scale modification methods have the problem that as the modification rate gets higher the time-scale modified speech signal becomes less intelligible, because they ignore the effect of articulation rate on speech characteristics. In this paper, we propose a variable time-scale modification method based on the knowledge that the timing information of transient portions of a speech signal plays an important role in speech perception. After identifying transient and steady portions of a speech signal, the proposed method gets the target rate by modifying steady portions only. The result of subjective preference test indicates that the proposed method produces performance superior to that of the conventional SOLA method.
منابع مشابه
Computationally Effic Modification of Speech Us
Among the conventional time-scale modification methods [1][6], the synchronized overlap and add (SOLA) method [4] is used widely because of its good performance with relatively low computational complexity. But the SOLA method still requires much computation in evaluating the normalized crosscorrelation function for synchronization procedure [9]. In this paper, we employ 3 level center clipping...
متن کاملTransient Natural Convection in an Enclosure with Variable Thermal Expansion Coefficient and Nanofluid Properties
Transient natural convection is numerically investigated in an enclosure using variable thermal conductivity, viscosity, and the thermal expansion coefficient of Al2O3-water nanofluid. The study has been conducted for a wide range of Rayleigh numbers (103≤ Ra ≤ 106), concentrations of nanoparticles (0% ≤ ϕ ≤ 7%), the enclosure aspect ratio (AR =1), and temperature differences between the cold a...
متن کاملPractical high-quality speech and voice synthesis using fixed frame rate ABS/OLA sinusoidal modeling
This paper describes algorithms developed to apply the Analysis-by-Synthesis/Overlap-Add (ABS/OLA) sinusoidal modeling system to real-time speech and singing voice synthesis. As originally proposed, the ABS/OLA system is limited to unidirectional timescaling, and relies on variable frame length to accomplish time-scale modification. For speech and voice synthesis applications, unidirectional ti...
متن کاملA Speaking Rate Normalization Method Using Time-Scale Modification for Speech Recognition
In this paper, we propose a speaking rate normalization method by selecting a scaling factor of time-scale modification for speech recognition. It is shown from the speech recognition experiments that the proposed method reduces average word error rate compared to that without using any speaking rate normalization.
متن کاملEffects of Pitch Contours Stylization and Time Scale Modification on Natural Speech Synthesis
This paper describes the method of generation of intonated speech for natural speech synthesis using prosody generation model. The effect of pitch modification through pitch contour stylization for parameter extraction and time scale modification for it’s implementation has been mentioned. An approach for close-copy syllabic stylization has been described. In the latter part, algorithm for impl...
متن کامل